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Abstract-In  this  paper,  a  new  method  to  model  ECG 
signals  by  means  of  "Predefined  Personalized  Signature 
and  Envelope  Functions"  is  presented.  ECG  signals  are 
somewhat  unique  to  a  person.  Moreover,  it  presents 
quasi-stationary  behavior.  Therefore  in  this  work,  on  a 
frame  basis,  personal  ECG  signals  Xft)  is  modeled  by  the 
form  of  X'(t)  ~  Ci(p,(t)  aft).  In  this  model,  (pft)  is  defined  as 
the  Personalized  .Signature  Function  (PSF);  Oi(t)  is 
referred  to  as  Personalized  Envelope  Function  (PEF)  and 
C,  is  called  the  Frame-Scaling  Coefficient  (FSC).  It  has 
been  demonstrated  that  for  each  person,  the  sets 
<5>={(pk(t)}  and  A  ={a,ft)}  constitute  a  "Predefined 
Personalized  Functional  Bases  or  Banks  (PPFB)"  to 
describe  any  measured  ECG  signal.  Almost  optimum 
forms  of  (PPFB),  namely  {aft)},  {(pft)}  pairs  are 
generated  in  the  Least  Mean  Square  (LMS)  sense.  Thus, 
ECG  signal  for  each  frame  is  described  in  terms  of  the 
two  indices  "R"  and  "F"  of  PPFB  and  the  frame-scaling 
coefficient  C,.  It  has  been  shown  that  the  new  method  of 
modeling  provides  significant  data  compression. 
Furthermore,  once  PPFB  are  stored  on  each 
communication  node,  transmission  of  ECG  signals 
reduces  to  the  transmission  of  indexes  "F"  and  "F"  of 
|  a,(t),tpk(t)\  pairs  and  the  coefficients  C„  which  also  result 
in  considerable  saving  in  the  transmission  band. 

Keywords  -  Models,  Compression,  Transmission  of  ECG 
Signals 

I.  Introduction 

Biomedical  signals  are  used  widespreadly  in  diagnosis  and 
therapy  of  many  diseases.  The  processing,  storing  and 
transfer  of  biological  signals  is  gaining  ever  more  importance 
in  today’s  information  era.  However,  since  such  signals 
comprise  huge  amounts  of  data,  the  storage,  transfer  and 
reconstruction  of  biological  signals  create  certain  limitations. 
One  way  to  overcome  this  problem  is  the  compression  of  the 
signals,  provided  that  the  information  covered  by  the  signal  is 
not  lost.  Moreover,  such  signals  have  to  be  repeatedly 
received  and  evaluated  during  the  course  of  the  illness  in 
order  to  verify  the  diagnosis,  determine  the  treatment 
method(s)  and  allow  follow  up  of  the  therapy  to  avoid 
abnormal  patterns  and  complications.  Therefore,  compression 
of  signals  with  acceptable  loss  is  inevitable. 

Diagnosis,  compression  and  speedy  transmission  of  the 
ECG  signals  may  be  achieved  by  means  of  appropriate 
models  with  least  number  of  parameters.  In  this  case,  the 
measured  ECG  data  may  be  described  in  terms  of  the 
parameters  of  the  selected  model.  Usually,  linear  models  are 


preferred  for  the  sake  of  simplicity.  In  the  existing  literature, 
efforts  have  been  made  to  represent  the  ECG  signals  in  terms 
of  the  linear  combination  of  somewhat  orthogonal  functions 
such  as  complex  exponentials,  trigonometric  functions, 
wavelets  or  pre-selected,  signature  functions,  which  best 
describe  the  signal  [1-7].  For  example,  in  [5-6],  over  a  finite 
time  interval,  limited  number  of  ad-hoc  sinusoidal  waveforms 
with  arbitrary  frequencies  is  selected  as  signature  functions. 
Then,  these  forms  are  orthogonalized  to  constitute  a 
functional  base  to  represent  the  original  ECG  signal. 
Eventually,  the  ECG  data  is  approximated  by  the  weighted 
sum  of  these,  so  called  signature  base  functions  obtained 
from  the  previous  step.  In  these  techniques  however,  the 
optimum  form  of  the  signature  base  functions  and  the 
expected  minimum  numbers  of  terms  in  the  summation,  are 
in  question.  The  difficulties  of  the  above-mentioned 
techniques  have  been  overcome  with  the  novel  modeling 
method  proposed  in  this  paper.  The  new  method  is  based  on 
the  generation  of  the  so  called  the  "Predefmed-Personalized 
Functional  Bases  or  Banks  -  PPFB".  The  PPFB  consist  of  the 
two  major  functional  banks,  namely  the  "Envelope  Functions 
Bank  -  EFB"  and  the  "Signature  Functions  Bank  -  SFB".  The 
measured  ECG  data  is  then;  modeled  as  the  multiplication  of 
two  appropriate  functions,  which  are  retrieved  within  a 
constant  from  the  above-mentioned  banks  respectively.  In  the 
following  sections  first  the  new  technique  to  model  the 
measured  ECG  data  is  introduced.  Then,  SFB,  EFB  and 
reconstruction  algorithms  are  presented.  Finally,  an  example 
is  given  to  exhibit  the  utilization  of  the  new  modeling 
technique. 

II.  New  METHOD 

ECG  signals  are  somewhat  unique  to  a  person.  Moreover, 
on  the  personnel  basis,  it  presents  a  quasi-stationary  like 
behavior.  Therefore,  it  would  be  appropriate  to  extract  the 
statistical  features  of  the  ECG  signals  over  a  reasonable 
length  of  time.  For  the  sake  of  practicality,  we  present  the 
new  technique  on  the  discrete  time  domain  since  all  the 
measurements  are  made  with  digital  equipment.  Let  X(n)  be 
the  discrete  time  domain  representation  of  the  measured  ECG 
signal  of  a  person  with  length  N.  Let  this  signal  be  analyzed 
frame  by  frame.  Let  Xfn)  represents  the  portion  of  the 
original  signal  within  frame  "i".  Then,  we  make  the  following 
"Main  Statement"  which  constitutes  the  basis  of  the  proposed 
technique  in  this  work. 

Main  Statement: 

(a)  For  any  time  frame  "i",  the  measured  ECG  signal  which  is 
given  by  the  vector  X i ,  can  be  expressed  or  approximated  as 
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where 


xi  =  Ci<pja.  (la) 

where  C,  is  a  real  constant,  <pjT  =  [<p] ,  <p2 ,  ¥>3 . <?i  J  is  a  row 

vector.  The  vector  Ci(pi  carries  "almost  maximum  energy" 
of  Xj  in  the  LMS  sense.  That  is  to  say,  Qip,-  is  the  best 
approximation  of  X,  with  one  term  (i.e.  Xt  ~  Cyp, )  that 
minimizes  the  sum  of  square  error  point  by  point,  over  the 
frame  under  consideration. 

The  (Lf  by  LF)  diagonal  matrix  a 


as  an  envelope  term,  which  may  satisfy  the  equality  or  further 
reduce  the  LMS  error  in  (la).  The  integer  LF  designates  the 
total  number  of  elements  in  a  frame  "i". 

(b)  In  continuous  time  domain,  (la)  takes  the  form  of 

Xl(t)  =  Cl<pl(t)al(t)  (lb) 

Based  on  the  main  statement  we  can  make  the  following 
definitions. 


an  0  0.... 

0  ai2 

0  «;  3 


aiL  f 


acts 


Definition  la:  The  vector  (/),  (or  equivalently  sequence  (pt  ) 

is  called  the  Personalized  Signature  Vector  (PSV)  or 
Signature  Sequence  since  it  carries  almost  maximum  energy 
of  the  ECG  frame  vector  Xt  with  a  constant  C,. 

Definition  lb:  In  a  similar  manner  to  that  of  Definition  la, 
the  continuous  time  domain  function  (p/it)  is  called  the 
Personalized  Signature  Function  (PSF). 

Definition  2a:  The  diagonal  matrix  a,  (or  equivalently 
sequence  a,  )  is  called  the  Personalized  Envelope  Matrix 
(PEM)  or  Envelope  Sequence  since  it  matches  the  envelope 
of  Citpi  to  the  original  ECG  frame  vector  Xh 
Definition  2b:  In  a  similar  manner  to  that  of  Definition  2a, 
the  continuous  function  afit)  is  called  the  Personalized 
Envelope  Function  (PEF). 

Definition  3:  The  real  constant  C,  is  called  the  Frame  Scaling 
Coefficient  (FSC). 

In  order  to  verify  the  main  statement,  let  us  proceed  as 
follows. 

A  discrete  ECG  signal  x(n) ,  can  be  written  as 

N 

x(n)  =  X  x;-  .Sj  (n  -  i)  (2a) 

/=! 

In  this  equation,  8 ,(«)  represents  the  unit  sample;  X, 
designates  the  amplitude  of  the  sequence  x(n)  of  length  N. 
x(n)  can  also  be  given  employing  the  vector/matrix  notation. 

XT  =\x(\)  x(2)  x(  /V)]  —  [x]  x2  XN]  (2b) 


X  is  called  the  " Main  Frame  vector"  and  it  is  divided  into 
sub-frames  with  equal  lengths,  having,  for  example,  8,16  or 
32  samples  etc.  The  Frame  Matrix  that  is  represented  by  M F 
is  obtained  by  means  of  the  sub-frame  vectors. 


Mp  = 


Xl  X2 


X 


Nr. 


x(i-l)LF+l 
_  Hi-\)LF+2 

xiLp 

N F  =  N/LF  designates  the  total  number  of  frames  in  X.  It 
can  be  shown  that  each  frame  sequence  or  vector  X,  can  be 
spanned  to  a  vector  space  formed  by  the  orthonormal  vectors 
{ Vki ;  k=l,2,3,..LF },  such  that 

lF 

xi  =  2  ck  X]d  (5) 

k= 1 

ck=(xOTvki  (6) 

are  determined  by  minimizing  the  expected  value  of  the 

error  vector  e—Xi  -  ^ ,ck.Vki  with  respect  to  Vki.  This 

k= 1 

process  is  called  the  "determination  of  Vki  in  the  LMS 
sense”.  Eventually  Vki  are  computed  as  the  Eigenvectors  of 
the  Autocorrelation  Matrix  Rj  of  the  sub-frame  sequence  Xr 
It  is  straightforward  to  obtain  the  autocorrelation  matrix  R,  as 


(4) 

i  =  1,2... Np 


1 

q  (2) 

q(3) 

1 - 

w 

rj  (2) 

rt(  1) 

,-.(2) 

rt{Lp  -1) 

II 

'■j  (3) 

’■j  (2) 

rj  (1) 

’■j(Lp  -2) 

r 

'P  •• 

rj(Lp  -1) 

fiLp-2)  - 

’■j  (1) 

where 

1  l( iLF  J 

q(rf  +  l)= -  r  2  }xj+ixj+X+d  ,  d=0,l,2,..Lp-l  (8) 

Lp  -d  J=W-V)  Lp+\ J  J  J 

It  should  be  noted  that  Rt  is  a  positive-semi  definite,  real 
symmetrical  and  toeplitz  matrix.  The  above-mentioned  LMS 
process  results  in  the  following  eigen-value  problem. 

Rivki=hivki  i  k  =  \,2,...,LF  (9) 

Obviously,  Lki  and  Vki  are  the  eigen-values  and  the 
eigenvectors  of  the  problem  under  consideration.  It  is  well 
known  that  the  eigen-values  of  the  Rt  are  also  real  and  non¬ 


negative.  Moreover,  the  eigenvectors  Vkj  are  all  orthonormal. 
Hence,  (5-6)  follows. 

Let  eigen-values  be  sorted  in  descending  order  such  that 

(%  >l2i  >Xy...>XL  j)  with  corresponding  eigenvectors. 

F 

Total  energy  of  the  sub-frame  "i"  is  given  by  X^.X, 

Lr, 


t  LF  2  LF  2  LF 
Xj  .Xj  =  X  xki  =  E  cki  =  E 
k= 1  A-=l  k= 1 


(10) 


The  eigenvectors  that  have  the  highest  energy  associated 
with  the  highest  value  of  the  eigen-values  represent  the 
directions  of  greatest  variations  of  the  signal  and  they  are  also 
called  the  principle  components.  Therefore,  (5)  may  be 
truncated  by  taking  the  first  "  p "  principal  components, 


which  have  the  highest  energy  of  the  original  signal  such  that 

Xi=Y^ck.Vkl 

k= 1 


(3) 


(11) 


The  simplest  form  of  (10)  can  be  obtained  by  setting 
p=  1 .  The  eigenvector  Vu  is  called  "the  major  signature 
vector" .  That  is  to  say,  the  major  signature  vector,  which  has 
the  highest  energy  in  the  LMS  sense,  may  approximate  each 
frame  that  belongs  to  the  original  ECG  signal.  Thus, 

xt  =  c,y- 1,  (12) 

In  this  case,  the  sub-frame  length  LF  must  be  selected  in 
such  a  way  that  practically  almost  maximum  energy  is 
captured  within  the  first  term  of  (5)  and  the  rest  are 
negligible.  Hence,  (12)  follows.  In  (12)  Vu  is  also  called  the 
signature  sequence  since  it  contains  most  of  the  useful 
information  of  the  original  signal  within  the  sub-frame. 

Once  the  approximation  (12)  is  obtained,  it  can  be 
converted  to  equality  by  means  of  an  "envelope  diagonal 
matrix  A,-  "  for  each  sub-frame.  Thus,  X,  is  computed  as 

Xi=C,VuAi  (13) 

In  (13),  diagonal  components  air  of  the  Matrix  At  is 
computed  in  terms  of  the  components  vUr  of  the  Major 
Signature  vector  V n  and  the  components  xir  of  the  sub-frame 
vector  Xj  by  simple  division  (i.e.  air  =xir/Cj\r  ;r=l,2,...,LF). 

In  this  research  work,  for  a  person,  many  ECG  signals 
were  examined  and  thousands  of  sub-frames  were  analyzed. 
It  has  been  observed  that 

Patterns  obtained  by  plotting  aj(n)=(air  vs.  sub-frame 
index-n=l,2,...,Lf )  and  Vi(n)=(vur  vs.  sub-frame  index- 
n=l,2,...,Lf)  exhibits  repetitive  similarities.  Above-mentioned 
patterns  somewhat  unique  to  a  person.  Therefore,  we  say  that 
ECG  signals  are  unique  to  a  person  and  exhibits  quasi¬ 
stationary  like  behavior.  It  is  deduced  that  similar  patterns 
can  be  eliminated;  and  we  can  create  two  types  of  bases  or 
banks;  namely,  "envelope”  and  "signature"  banks 
respectively  with  reduced  envelope  and  signature  sequences. 

Eventually,  for  any  given  sub-frame  "i",  the  frame 
sequence  X(  can  be  approximated  by  pulling  an  appropriate 
signature  vector  from  the  signature  bank  with  a  constant  c in 
the  LMS  sense  (i.e.  X \  ~c,.Vp).  Reduced  signature  vectors  are 
renamed  under  the  signature  bank  as  { (pn  (») ;  ns=l,2,...,Ns}. 

The  integer  Ns  designates  the  total  number  of  "Signature 
Forms"  in  the  "Signature  Bank".  Similarly,  reduced 
envelope  sequences  or  diagonal  matrices  are  renamed  in  the 
envelope  bank  as  {an  («);  ne=l,2,...,NE}.  The  integer  NE 

designates  the  total  number  of  "Envelope  Forms"  in  the 
"Envelope  Bank". 

Signature  and  envelope  sequences  may  as  well  be 
considered  as  continuous  time  domain  functions,  namely 
signature  functions  { (pn  (t) ;  ns=l,2,...,Ns }  and  envelope 

functions  {an  (t) ;  ne=l,2,...,NE}.  These  continuous 

functions  are  simply  obtained  by  passing  the  corresponding 
sequences  through  a  low-pass  filter. 

Finally,  all  the  above  explanations  suggest  the  phrase  of 
the  main  statement.  Thus,  we  say  that  "any  ECG  signal  frame 
X;  may  be  represented  in  terms  of  the  multiplication  of 
personalized  envelope  OCj(t)  and  signature  tp;(t)  functions  with 
a  constant  Q"  or  Xft)  =  Ci(pi(t)aj(t) 


In  the  following  section,  the  new  ECG  modeling 
algorithms,  which  utilize  predefined  personalized  envelope 
and  signature  banks  is  presented. 

III.  Algorithms 

The  new  modeling  method  is  implemented  within  two 
major  algorithmic  steps.  First  the  predefined  personalized 
envelope  and  signature  banks  (PPFB)  are  generated 
(Algorithm  1).  Then,  any  ECG  signal  is  reconstructed 
utilizing  the  (PPFB)  (Algorithm  2).  Therefore,  in  this  section, 
first  Algorithm  1  is  introduced.  Then  Algorithm  2  is  outlined. 
Algorithm  1:  Generation  of  the  PPFB 
Inputs: 

•  Main  Frame  sequence  of  the  ECG  Signal 

•  Total  Number  of  samples  in  the  sub-frames:  ( LF ) 

Step  1:  Compute  the  total  number  of  frames  NF=N/LF 
Step  2:  Sort  the  Main  frame  X  into  sub -frames  Xt. 

Step  3 :  For  each  sub-frame  X,  compute  the  R, 

Step  4:  For  each  R,  compute  the  eigen-values  Xki  and  the 
corresponding  eigenvectors  Vki. 

Step  5a:  Store  the  eigenvector  Vri  which  is  associated  with  the 
maximum  eigen-value  A„=max  G  h.  u. ...  x,  ■}  and  simply 

I/’  2.1 5  ’  E p^l 

refer  this  vector  with  the  frame  index,  such  as  Vu. 

Step  5b:  Find  the  Frame  Scaling  Coefficient  C,  in  the  LMS 
sense  to  approximate  X-C-,  V,,. 

Step  6:  Do  step  5  for  all  the  sub-frames  ( i=l,2,...,NF •).  At  the 
end  of  this  step,  you  will  be  collecting  the  eigenvectors, 
which  have  the  maximum  energy  for  each  frame. 

Step  7:  Compare  all  the  collected  eigenvectors  in  Step  6,  with 
an  efficient  algorithm,  and  eliminate  the  similar  ones.  Thus, 
generate  the  Signature  Bank  {(pn  (n);  ns=l,2,...,Ns}  with 

reduced  number  of  eigenvectors  Vn.  Here,  Ns  designates  the 
total  number  of  original  signature  patterns  after  the  reduction. 
Step  8:  Compute  the  Diagonal  Envelope  Matrix  H,  =diagonal 
[au,  ai2,...,a,  ]  for  each  C,F,  computed  in  step  5b  such  that 

lLF 

air  =xirlCiv\ir  ’  r=l,2,...,LF). 

Step  9:  As  in  Step  7,  eliminate  the  similar  patterns  of 
envelope  sequences  with  an  efficient  algorithm  and  create  the 
envelope  bank  with  reduced  number  original  sequences 
{ an  («) ;  ne=l,2,...,NE}-,  Here,  NE denotes  the  total  number  of 

original  envelope  patterns  in  the  envelope  bank. 

The  above  algorithm  is  utilized  for  many  different  ECG 
signals  for  a  person  to  enhance  the  formation  of  the 
personalized  signature  and  envelope  banks. 

Once  the  personalized  signature  and  envelope  banks  are 
generated,  then,  any  ECG  signal  can  be  modeled  by  means  of 
PPFB  as  described  in  Algorithm  2. 

Algorithm  2:  Reconstruction  of  ECG  Signals  in  terms  of 

PPFB 

Inputs: 

•  ECG  signal  {X(n),  n=l,2,...,N}  to  be  modeled. 

•  PPFB  created  utilizing  the  algorithm  1 . 

•  LF  :  Number  of  Samples  in  each  sub-frame. 

Step  1 :  Sort  the  sub-frames  X,  as  in  algorithm  1 . 


Step  2: 

(a)  For  each  sub-frame  "i"  pull  an  appropriate  signature 
sequence  < p*  such  that  the  distance  or  the  total  error 


8  = 


xi  -ckVk 


is  minimum  for  all  k=l,2,.. 


.Ng. 


(b)  Store  the  index  number  K  that  refers  to  (pK.  Hence, 
Xi  ~  CKcpK 

Step  3: 

(a)  Pull  the  appropriate  envelope  sequence  (or  diagonal 
matrix)  aR  such  that  the  error  is  further  minimized  for  all 

r=l,2,...,NE.  5R=min{jxi-CK(pKaif};r  =  l,2,-,NE  ■ 

(b)  Store  the  index  number  R  that  refers  to  aR  . 

It  should  be  noted  that  at  the  end  of  this  step,  the  best 
envelope  aR  and  signature  (pK  patterns  are  found  by 
appropriate  selections.  Hence,  the  sub-frame  X,  is  best 
described  in  terms  of  the  patterns  of  aR  and  (pK.  i.e. 
Xt  ~cpKaR. 


Step  4:  Having  fixed  aR  and  (pK,  compute  the  new  frame 
coefficient  C,  to  find  the  global  minimum  of  the  error 

&  Global  ~  min{  Xi  ~^-i(PKaR  P}  and  store  it-  At  this  step,  the 


frame  sequence  is  approximated  as  X,  =  Cj(pKaR 

Step  5:  Repeat  the  above  steps  for  each  frame  to  reconstruct 

the  original  ECG  sequence  X(«) . 

Eventually,  the  above  algorithm  yields  the  frame 
coefficient  C,  and  the  index  numbers  R  and  K  for  the  best 
envelope  and  the  signature  sequences  respectively  for  each 
frame.  Thus,  each  frame  X,  is  referred  by  means  of  the  FSC- 
C,  and  the  indices  R  and  K  with  an  acceptable  error 

^ Global  =  \Xi-Ci(pKaRf . 

Hence,  in  the  reproduction  of  ECG  signals,  substantial 
compression  is  achieved.  Once  the  PPFB  are  stored  in  each 
communication  node,  then  considerable  amount  of  saving  in 
the  transmission  band  will  be  obtained. 

Now,  let  us  see  the  merits  of  the  newly  proposed  ECG 
modeling  technique  with  an  example. 


IV.  Example 


In  this  example,  first,  several  ECG  signal  data  is  recorded 
for  a  person.  Each  recording  sampled  with  500  Hz  and 
contains  2048  samples.  Then,  employing  the  algorithm  1,  we 
generated  envelope  and  signature  sequences  bank.  In  the 
computations,  the  sub-frame  length  NF=  16  was  selected.  It 
was  found  that  with  an  error  8 aobai  <10'3,  we  were  able  to 
reconstruct  all  the  measured  ECG  signals  with  5  original 
signature  (i.e.  Ns=5)  and  315  envelope  (i.e.  ,¥/— 3 1 5 )  patterns. 
Let  us  presume  that  12  bits  represent  each  sample  of  the  sub- 
frame  Xr  Then,  each  frame  of  16  samples  will  be  represented 
with  12x16=192  bit.  On  the  other  hand,  employing  the  newly 
proposed  technique,  each  frame  is  described  by  means  of  C, 
(FSC)  and  two  frame  indices  R  and  K.  In  this  example,  C,  is 
represented  by  12  bits.  R  and  K  are  represented  by  total  of  12 
bits;  then,  24  bits  will  be  good  enough  to  represent  each 
frame.  Thus,  compression  rate  of  192/24=8  is  obtained.  In 


Figure  1,  original  and  reconstructed  signals  are  shown.  As 
can  be  seen  from  Figure  1  that  original  and  reconstructed 
signal  agrees  within  an  error  of  10"3. 


Fig.  1 .  Original  and  Reconstructed  ECG  Signals 

V.  Conclusion 

In  this  paper,  a  novel  method  to  represent  ECG  signals  is 
presented.  The  proposed  technique  is  based  on  the  generation 
of  the  so-called,  predefined  personalized  signature  and 
envelope  banks.  It  has  been  exhibited  that  the  new  method  of 
modeling  ECG  signals,  yields  substantial  compression  and 
results  in  band  saving  in  transmission.  It  is  expected  that 
detail  characterization  and  classification  of  PPFB  by 
disorders  will  lead  to  diagnosis  of  heart  diseases  in  the 
follow-up  research  work. 
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